Picture for Zhang Zhang

Zhang Zhang

PillarMamba: Learning Local-Global Context for Roadside Point Cloud via Hybrid State Space Model

Add code
May 08, 2025
Viaarxiv icon

Occupancy World Model for Robots

Add code
May 07, 2025
Viaarxiv icon

R1-Reward: Training Multimodal Reward Model Through Stable Reinforcement Learning

Add code
May 05, 2025
Viaarxiv icon

Large Language Models are Qualified Benchmark Builders: Rebuilding Pre-Training Datasets for Advancing Code Intelligence Tasks

Add code
Apr 28, 2025
Viaarxiv icon

A Call for New Recipes to Enhance Spatial Reasoning in MLLMs

Add code
Apr 21, 2025
Viaarxiv icon

RoboOcc: Enhancing the Geometric and Semantic Scene Understanding for Robots

Add code
Apr 20, 2025
Viaarxiv icon

Open-Medical-R1: How to Choose Data for RLVR Training at Medicine Domain

Add code
Apr 16, 2025
Viaarxiv icon

MME-Unify: A Comprehensive Benchmark for Unified Multimodal Understanding and Generation Models

Add code
Apr 07, 2025
Viaarxiv icon

Q-MambaIR: Accurate Quantized Mamba for Efficient Image Restoration

Add code
Mar 27, 2025
Viaarxiv icon

Aligning Multimodal LLM with Human Preference: A Survey

Add code
Mar 18, 2025
Viaarxiv icon